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The research earned out in ihe subject application was supported in part by grants 
from the National Institutes of Health. The government may have rights in any patent issuing 
on this application. 

INTRODUCTION 

Field of the- Invention 

The field of this invention is transcription factor proteins involved in vascularization. 

Background 

Roughly a dozen proteins classified as basic helix-loop-helix/PAS domain 
transcription factors have been described in both vertebrates and invertebrates. Members of 
this class derive their name from the shared presence of a basic helix-loop-helix (bHLH) 
motif that specifies sequence dependent recognition of DNA and a PAS domain composed of 
two-imperfect repeats. PAS is an acronym derived from the first three proteins observed to 
contain this motif. These include the product of the^norf gene of Drosoplula melanogasier 
(Jackson et al. 1986; Citri et al. 1987), the aryl hydrocarbon nuclear transporter gene (ARNT) 
of mammals (Burbach et al. 1992), and the product of the fruit fly single-minded gene 
(Nambu et al. 1991). 

The imperfect, direct repeats within the PAS domain are approximately 50 amino 
acids in leneth and contain a signature His-X-X-Asp sequence in each repeat. Three 
biochemical functions have been assigned to the PAS domain. First, it acts in concert with 
the helix-loop-helix domain of bHLH/PAS proteins to form a dimerization surface (Reisz- 
Porszasz et al. 1994; Fukunagaet al. 1995; Lindebro et al. 1995). In the case of the period 
»ene product, which lacks a bHLH domain, the PAS domain specifics heterodimenzation 
with the product of the tuneless locus (Gekakis et al. 1995 ; Myers et al. 1995). Interaction 
between ihc period and timeless gene products represents a crucial event in the control of 
circadian rhythm in fruit flics (Hunter-Ensor et al. 1996; Lee et al. 1996; Myers et al. 1996; 
Zen* et al 1996). In contrast, the aryl hydrocarbon receptor (AHR) heterodimenzes with 
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ARNT via PAS domain interactions (Fukunaga et al. 1995), producing a heterodimer that is 
competent for nuclear gene interaction. Second, the PAS domain mediates interaction with 
heat shock protein 90 (HSP-90). Several PAS domain proteins, including the single-minded 
gene product and the AHR, can be sequestered in the cytoplasm in an inactive state. 
Maintenance of the inactive state involves interactions between the PAS domain and HSP-90 

5 (Perdew, 1988; Chen and Perdew, 1994; Henry and Gasiewicz, 1993; McGuire et al. 1995). 
Finally, the PAS domain of the AHR facilitates high affinity binding of certain xenobiotic 
compounds including dioxin (reviewed in Hankinson, 1995; Schmidt and BradfiekL 1996). 

PAS domain transcription factors perform diverse functions in a variety of cell types 
and organisms. The period gene product helps regulate circadian rhythm in fruit flies 

10 (Konopka and Benzer. 1971), whereas the mammalian AHR provides response to xenobiotics 
by activating genes whose products facilitate detoxification (Schmidt and Bradfield, 1996). A 
more recently described member of the PAS domain family, hypoxia inducible factor (HIF- 
lct), activates genes whose products regulate hematopoiesis in response to oxygen deprivation 
(Wang et al. 1995). In Drosophila, the single-minded gene product affects neurogenesis 

15 (Nambu et al. 1991) and the trachealess gene product controls the formation of tubular 
structures in the embryo (Wilk et al. 1996; Isaac and Andrew, 1996). 

The utilization of bHLH/PAS domain proteins in diverse species and physiological 
processes raises the possibility that this family of transcription factors might consist of many 
undiscovered members. Here we report the initial characterization of new members of this 

20 protein family collectively designated endothelial PAS domain protein 1 (EPAS1). 

SUMMARY OF THE INVENTION 
The invention provides methods and compositions relating to endothelial PAS domain 
protein 1 (EPASi), related nucleic acids, and protein domains thereof having EPAS1 -specific 

25 activity. EPAS1 proteins can regulate specification of endothelial tissue, such as vasculature, 
the blood brain barrier, etc. The proteins may be produced recombinantly from transformed 
host cells from the subject EPAS1 encoding nucleic acids or purified from mammalian cells. 
The invention provides isolated EPAS1 hybridization probes and primers capable of 
specifically hybridizing with the disclosed EPAS1 gene, EPASl-specific binding agents such 

30 as specific antibodies, and methods of making and using the subject compositions in 

diagnosis (e.g. genetic hybridization screens for EPAS1 transcripts), therapy (e.g. gene 
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therapy to modulate EPAS1 gene expression) and in the biopharmaceutical industry (e.g. as 
immunogens. reagents for isolating B-cell specific activators or other transcriptional 
regulators, reagents for screening chemical libraries for lead pharmacological agents, etc.). 

SEQ ID NO: LISTING 
5 SEQ ID NO: 1 : human EPAS 1 cDNA. 

SEQ ID NO:2: murine EPAS1 cDNA. 

SEQ ID NO:3: HIF-l a binding site. 

SEQ ID NO:4: human EPAS1 protein. 

SEQ ID NO:5: murine EPAS1 protein. 
10 SEQ ID NO:6: human HIF-laprotein. 

SEQ ID NO:7: murine HIF-la protein 

DETAILED DESCRIPTION OF THE INVENTION 
The nucleotide sequence of a natural cDN A encoding a human and murine EPAS I 
15 proteins are shown as SEQ ID NOS:l and 2, respectively, and the full conceptual translates as 
SEQ ID NOS:4 and 5, respectively. The EPAS1 proteins of the invention include incomplete 
translates of SEQ ID NOS:l and 2 and deletion mutants of SEQ ID NOS:4 and 5, which 
translates and deletion mutants have EPAS 1 -specific amino acid sequence and binding 
specificity or function. Such active EPAS I deletion mutants, EPAS1 peptides or protein 
20 domains comprise at least 14, preferably at least about 16. more preferably at least about 20 
consecutive residues of SEQ ID NO:4 or 5. For examples, EPAS1 protein domains identified 
below are shown to provide dimerization, protein-binding, and nucleic acid binding function. 
Additional such domains are identified in and find use, inter alia, in solid-phase binding 
assays as described below. 
25 EPAS 1 -specific activity or function may be determined by convenient in vitro, cell- 

based, or in vivo assays: e.g. in vitro binding assays, cell culture assays, in animals (e.g. 
immune response, gene therapy, transgenics, etc.), etc. Binding assays encompass any assay 
where the molecular interaction of an EPAS1 protein with a binding target is evaluated. The 
binding target may be a natural intracellular binding target such as another bHLH/PAS 
30 protein, a heat shock protein, or a nucleic acid sequence/binding site or other regulator that 
directly modulates EPAS1 activity or its localization; or non-natural binding target such a 
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specific immune protein such as an antibody, or an EPAS1 specific agent such as those 
identified in screening assays such as described below. EPASl -binding specificity may 
assayed by binding equilibrium constants (usually at least about 10 7 M*\ preferably at least 
about 10 s M"', more preferably at least about 10 9 M" 1 ), by the ability of the subject protein to 
function as negative mutants in EP AS 1 -expressing cells, to elicit EPASl specific antibody in 
a heterologous host (e.g a rodent or rabbit), etc. In any event, the EPASl binding specificity 
of the subject EPASl proteins necessarily distinguishes HIF-la. 

The claimed EPASl proteins are isolated or pure: an "isolated" protein is 
unaccompanied by at least some of the material with which it is associated in its natural state, 
preferably constituting at least about 0.5%, and more preferably at least about 5% by weight 
of the total protein in a given sample and a pure protein constitutes at least about 90%, and 
preferably at least about 99% by weight of the total protein in a given sample. The EPASl 
proteins and protein domains may be synthesized, produced by recombinant technology, or 
purified from mammalian, preferably human cells. A wide variety of molecular and 
biochemical methods are available for biochemical synthesis, moiecuiar expression and 
purification of the subject compositions, see e.g. Molecular Cloning, A Laboratory Manual 
(Sambrook, et ai Cold Spring Harbor Laboratory), Current Protocols in Molecular Biology 
(Eds. Ausubel, et a/., Greene Publ. Assoc., Wiley-Interscience, NY) or that are otherwise 
known in the art. 

The invention provides natural and non-natural EPASl -specific binding agents, 
methods of identifying and making such agents, and their use in diagnosis, therapy and 
pharmaceutical development. For example, EPASl -specific agents are useful in a variety of 
diagnostic and therapeutic applications. Novel EPASl -specific binding agents include 
EPASl -specific receptors, such as somatically recombined protein receptors like specific 
antibodies or T-cell antigen receptors (see, e.g Harlow and Lane (1988) Antibodies, A 
Laboratory Manual. Cold Spring Harbor Laboratory) and other natural intracellular binding 
agents identified with assays such as one-, two- and three-hybrid screens, non-natural 
intracellular binding agents identified in screens of chemical libraries such as described 
below, etc. For diagnostic uses, the binding agents are frequently labeled, such as with 
fluorescent, radioactive, chemiluminescent, or other easily detectable molecules, either 
conjugated directly to the binding agent or conjugated to a probe specific for the binding 
agent. Agents of particular interest modulate EPASl function, e.g. EPASl -dependent 
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transcriptional activation: tor example, isolated cells, whole tissues, or individuals may be 
treated with an EPASl binding agent to activate, inhibit, or alter EPASl -dependent 
transcriptional processes. 

The amino acid sequences of the disclosed EPASl proteins are used to back-translate 
EPASl protein-encoding nucleic acids optimized for selected expression systems (Holler et 
5 al.(l993)Gene 136. 323-328: Martin et al. (1995) Gene 154, 150-166) or used to generate 
degenerate oligonucleotide primers and probes for use in the isolation of natural EPASl - 
encoding nucleic acid sequences ("GCG" software. Genetics Computer Group, Inc, Madison 
WD. EPASl -encoding nucleic acids used in EP AS 1 -expression vectors and incorporated into 
recombinant host cells, e.g. for expression and screening, transgenic animals, e.g. for 
10 functional studies such as the efficacy of candidate drugs for disease associated with EPASl - 
modulated transcription, etc. 

The invention also provides nucleic acid hybridization probes and replication / 
amplification primers having a EPASl cDNA specific sequence contained in SEQ ID NO.l 
and sufficient to effect specific hybridization thereto (i.e. specifically hybridize with SEQ ID 
15 NO-.l in the presence of endothelial cell cDNA). Such primers or probes are at least 12, 
preferably at least 24. more preferably at least 36 and most preferably at least 96 bases in 
length. Demonstrating specific hybridization generally requires stringent conditions, for 
example, hybridizing in a buffer composing 30% formamide in 5 x SSPE (0.18 M NaCl. 0.01 
M NaP0 4 , pH7.7, 0.001 M EDTA) buffer at a temperature of 42°C and remaining bound 
20 when subject to washing at 42°C with 0.2 x SSPE: preferably hybridizing in a buffer 

comprising 50% formamide in 5 x SSPE buffer at a temperature of 42°C and remaining, 
bound when subject to washing at 42°C with 0.2 x SSPE buffer at 42°C. EPASl cDNA 
homologs can also be distinguished from other protein using alignment algorithms, such as 
BLASTX (Altschul et al. (1990) Basic Local Alignment Search Tool, J Mol Biol 215, 403- 
25 410). 

The subject nucleic acids are of synthetic/non-natural sequences and/or are isolated, 
i.e. unaccompanied by at least some of the material with which it is associated in its natural 
state, preferably constituting at least about 0.5%, preferably at least about 5% by weight of 
total nucleic acid present in a g.ven fraction, and usually recombinant, meaning they comprise 
30 a non-natural sequence or a natural sequence joined to nucleotide(s) other than that which it is 
joined to on a natural chromosome. Nucleic acids comprising the nucleotide sequence of 
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SEQ ID NO: 1 or 2 or fragments thereof, contain such sequence or fragment at a terminus, 
immediately flanked by a sequence other than that which it is joined to on a natural 
chromosome, or flanked by a native flanking region fewer than 10 kb, preferably fewer than 2 
kb, which is at a terminus or is immediately flanked by a sequence other than that which it is 
joined to on a natural chromosome. While the nucleic acids are usually RNA or DNA, it is 
5 often advantageous to use nucleic acids comprising other bases or nucleotide analogs to 
provide modified stability, etc. 

The subject nucleic acids find a wide variety of applications including use as 
translatable transcripts, hybridization probes, PCR primers, diagnostic nucleic acids, etc.; use 
in detecting the presence of EPAS 1 genes and gene transcripts and in detecting or amplifying 

1 0 nucleic acids encoding additional EPAS1 homologs and structural analogs. In diagnosis, 

EPAS1 hybridization probes find use in identifying wild-type and mutant EPAS1 alleles in 
clinical and laboratory samples. Mutant alleles are used to generate allele-specific 
oligonucleotide (ASO) probes for high-throughput clinical diagnoses. In therapy, 
therapeutic EPAS1 nucleic acids are used to modulate cellular expression or intracellular 

15 concentration or availability of active EPAS1. 

The invention provides efficient methods of identifying agents, compounds or lead 
compounds for agents active at the level of a EPAS1 modulatable cellular function. J 
Generally, these screening methods involve assaying for compounds which modulate EPAS1 
interaction with a natural EPAS1 binding target. A wide variety of assays for binding agents 

20 are provided including labeled in vitro protein-protein binding assays, immunoassays, cell 
based assays, etc. The methods are amenable to automated, cost-effective high throughput 

screening of chemical libraries for lead comoounds. Identified reagents find use in the 

— - t _ 

pharmaceutical industries for animal and human trials; for example, the reagents may be 
derivatized and rescreened in in vitro and in vivo assays to optimize activity and minimize 

25 toxicity for pharmaceutical development. Target indications include neoproliferative disease, 
inflammation, hypersensitivity, wound healing, immune deficiencies, infection etc. 

In vitro binding assays employ a mixture of components including an EPAS1 protein, 
which may be part of a fusion product with another peptide or polypeptide, e.g. a tag for 
detection or anchoring, etc. The assay mixtures comprise a natural intracellular EPAS1 

30 binding target. While native binding targets may be used, it is frequently preferred to use 

portions (e.g. peptides) thereof so long as the portion provides binding affinity and avidity to 

6 
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the subject EPAS 1 protein conveniently measurable in the assay. The assay mixture also 
comprises a candidate pharmacological agent. Candidate agents encompass numerous 
chemical classes, though typically they are organic compounds; preferably small organic 
compounds and are obtained from a wide variety of sources including libraries of synthetic or 
natural compounds. A variety of other reagents may also be included in the mixture. These 

5 include reagents like salts, buffers, neutral proteins, e.g. albumin, detergents, protease 
inhibitors, nuclease inhibitors, antimicrobial agents, etc. may be used. 

The resultant mixture is incubated under conditions whereby, but for the presence of 
the candidate pharmacological agent, the EPAS 1 protein specifically binds the cellular 
binding target, portion or analog with a reference binding affinity. The mixture components 

1 0 can be added in any order that provides for the requisite bindings and incubations may be 
performed at any temperature which facilitates optimal binding. Incubation periods are 
likewise selected for optimal binding but also minimized to facilitate rapid, high-throughput 
screening. 

After incubation, the agent-biased binding between the EPAS1 protein and one or 

1 5 more binding targets is detected by any convenient way. For cell-free binding type assays, a 
separation step is often used to separate bound from unbound components. Separation may 
be effected by precipitation (e.g. TCA precipitation, immunoprecipitation, etc.), 
immobilization (e.g on a solid substrate), etc., followed by washing by, for examples, 
membrane filtration (e.g. Whatman's P-81 ion exchange paper, Polyfiltronic's hydrophobic 

20 GFC membrane, etc.), gel chromatography (e.g. gel filtration, affinity, etc.). For EPAS1- 
dependent transcription assays, binding is detected by a change in the expression of an 
EPAS 1 -dependent reporter. 

Detection may be effected in any convenient way. For cell-free binding assays, one of 
the components usually comprises or is coupled to a label. The label may provide for direct 

25 detection as radioactivity, luminescence, optical or electron density, etc. or indirect detection 
such as an epitope tag, an enzyme, etc. A variety of methods may be used to detect the label 
depending on the nature of the label and other assay components, e.g. through optical or 
electron density, radiative emissions, nonradiative energy transfers, etc. or indirectly detected 
with antibody conjugates, etc. 

30 A difference in the binding affinity of the EPAS 1 protein to the target in the absence 

of the agent as compared with the binding affinity in the presence of the agent indicates that 



BNSDOCID: <WO 9831701A1J_> 



WO 98/31701 PCT/US98/00813 

the agent modulates the binding of the EPAS1 protein to the EPAS1 binding target. 
.Analogously, in the cell-based transcription assay also described below, a difference in the 
EPAS1 transcriptional induction in the presence and absence of an agent indicates the agent 
modulates EPAS1 -induced transcription. A difference, as used herein, is statistically 
significant and preferably represents at least a 50%. more preferably at least a 90% difference. 
5 The following experimental section and examples are offered by way of illustration 

and not by way of limitation. 

EXPERIMENTAL 

cDNAs encompassing the coding region of the human EPASl were isolated by screening 
a HeLa cell cDNA library with a radiolabeled probe derived from an expressed sequence tag 
1 0 (#T70415) obtained from the Genbank data base (see Materials and Methods). Multiple cDNA 
clones were isolated and subjected to DNA sequence analysis to derive the conceptually 
translated protein sequence of human EPAS 1 shown in Table 1 . The predicted M r of the human 
EPASl was 96,528. A termination codon was located 24 nucleotides 5' of the designated 

initiator methionine in the human sequence, clunz-ys wiwu« 16 »»w 

1 5 isolated from an adult mouse brain cDNA library using a probe obtained by reverse transcriptase 
polymerase chain reactions with oligonucleotide primers derived from the human EPAS 1 cDNA 
sequence (see Materials and Methods). The predicted protein sequence of murine EPASl is 
aligned and compared with the human sequence in Table 1. The two proteins share 88% 
sequence identity. Data base searches revealed that the human and murine EPASl proteins share 
20 extensive primary amino acid sequence identity with hypoxia inducible factor-la (HIF-la), a 
member of the bHLH/PAS domain family of transcription factors (Wang et al. 1995; Wenger et 
al. 1995). EPASl and HIF-la share 48% primary amino acid sequence identity as revealed by 
the alignment shown in Table 1 . Sequence conservation between the two proteins is highest in 
the basic-helix-loop-helix (85%), PAS A (68%) and PAS-B (73%) regions. A second region of 
sequence identity occurs at the extreme carboxy terminis of the EPASl and HIF-la proteins. 
This conserved region in mHIFla has been recently shown to contain a hypoxia response domain 
(Li et al.. 1996). EPASl also shares sequence relatedness with other PAS domain proteins, 
however the degree of similarity between EPASl and other family members is less striking than 
that between HIF-la and EPASl . 
30 Genomic clones encoding the human EPASl transcript were isolated by screening 

bacteriophage libraries of human DNA. The intron-exon structure of the gene was established 



25 
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by comparison of DNA sequences obtained from the genomic DNA to that of the cDNA. The 
coding region of EPAS1 is specified by 15 exons. The exonic sequences mapped to six 
non-overlapping bacteriophage lambda clones whose average insert size was 20 kb. indicating 
that the EPAS1 gene spans at least 120 kb of genomic DNA. A comparison of the EPAS1 gene 
structure with that of the aryl hydrocarbon receptor (Schmidt et al. 1993) reveals that the 
positions of introns within the regions encoding the amino-terminal halves of the two proteins 
are highly conserved. In contrast, the portion of the EPAS 1 gene specifying the carboxy-terminal 
half of the protein is interrupted by seven introns, whereas the AHR gene contains only a single 
intron in this region. Thus the 5'-ends of the two genes may have arisen from an ancient gene 
duplication event, whereas the 3'-regions have a more recent evolutionary origin. 

Two methods were used to determine the chromosomal location of the human EPAS1 
gene. Fluorescent in situ hybridization (FISH) analysis was performed using a biotinylated probe 
containing exons S- 14 of the EPAS1 gene. This analysis revealed a single hybridization signal 
over chromosome 2, bands p 1 6-p2 1 . As a second assay for gene localization, an oligonucleotide 
primer pair derived from exon 8 was used to amplify a segment of the EPAS1 gene from the 
1 5 genomic DNAs of a radiation hybrid panel. Computer-assisted analysis of the results indicated 
linkage of the EPAS1 gene to the D2S288 marker on chromosome 2p with a LOD score of 8.7 
and a cR8000 value of 12.96. Thus, the data obtained from two independent mapping methods 
consistently positioned the EPAS1 gene on the short arm of chromosome 2 and indicate that the 
EPAS1 gene is non-syntenic with the HIF-la gene, which maps to chromosome 14q21-24 

20 (Semenza et al. 1996). 

The high degree of sequence similarity between the EPAS1 and HIF-la proteins raises 
the possibility that they share a common physiological function. To test this hypothesis, RNA 
blotting experiments were used to compare and contrast the distributions of EPAS 1 and HIF-la 
mRNAs in a variety of human tissues. An EPAS1 mRNA of approximately 5.8 kb was detected 

25 in all tissues examined with the single exception of peripheral blood leukocytes. Among the 
positive tissues, highly vascularized organs such as the heart, placenta and lung showed the 
hiehest levels of EPAS 1 mRNA. A HIF-la mRNA of approximately 4.4 kb was detected in all 
human tissues. In contrast to EPAS1 mRNA, however, peripheral blood leukocytes contained 
very high levels of HIF-la mRNA. Likewise, we observed no enrichment of HIF-la mRNA in 

30 highly vascularized tissues. 

These RNA blotting data indicate that, with few exceptions, most tissues express both 
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EPAS1 and HIF-lu mRNAs. To determine if tins overlap extended to the cellular level, in situ 
niRNA hybridization was used to determine the cell type specific expression patterns of the two 
gene products. Sections from day 1 1 and day 13 mouse embryos were examined first. In day 
1 1 embryo sections, EPASl transcripts were observed almost exclusively in endothelial cells of 
the intersegmental blood vessels separating the somites, the atrial and ventricular chambers of 
the heart, and the dorsal aorta. Extra-embryonic membranes, such as the yolk sac, which are 
hishly vascularized, also expressed abundant levels of EPASl mRNA. In the developing brain 
of a day 13 embryo, endothelial cells of the highly vascularized choroid plexus contained 
abundant EPASl transcripts. The brain section also revealed intense EPASl mRNA 
hybridization in the endothelial cells of a blood vessel lying along the edge of post-mitotic 
neurons emanating from the lateral ventricle region. When a nearby section was hybridized with 
an anti-sense probe that was specific for the HIF-la mRNA, only a diffuse signal somewhat over 
background was detected, indicating a low level of HIF-la expression in many cell types. In 
contrast to the results with the EPASl probe, no concentration of HIF-la mRNA was detected 
in the endothelial ceils of the adjacent blood vessel. A differential expression pattern between 
EPASl and HIF-la was also apparent in the region of the embryo containing the umbilicus. 
EPASl transcripts were detected in the endothelium of blood vessels within this structure, 
whereas HIF-la mRNA was concentrated in the mesenchyme surrounding the vascular 
endothelium. 

In tissues of adult mice, EPASl mRNA was also detected at high levels in endothelial 
cells, yet was also present at lower levels in several additional cells types. For example, decidual 
cells of the placenta contained very high levels of EPASl mRNA as did parenchymal tissue in 
the lung. The distinction between EPASl expressing cell types and HIF-la expressing cells was 
also apparent in adult tissues. A section through the cortex of the kidney showed EPASl 
expression in the mesangial cells. In contrast, HIF-la expression was found in the cells of the 
collecting ducts. Taken together, these in situ mRNA hybridization results reveal very divergent 
patterns of EPASl and HIF-la mRNA distribution. 

The presence of basic helix-loop-helix and PAS domain motifs in EPASl raised the 
possibility that this protein might be capable of forming a complex with the aryl hydrocarbon 
receptor nuclear transport protein (ARNT) (Hoffman et al. 1991), and that the resulting 
heterodimer might exhibit sequence-specific DNA binding. To test these predictions, EPAS 1 and 
ARNT expression vectors were used to program a reticulocyte lysate. The EPASl expression 
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vector was modified at its carboxy-terminus with a c-Myc epitope tag to facilitate immunological 
detection of the EPAS1 translation product. Radiolabeled methionine was included in the 
translation mix containing the ARNT mRNA, whereas unlabeled methionine was used in the 
EPAS1 reaction. After translation, the two reactions were mixed and subsequently incubated 
with a monoclonal antibody that recognizes the c-Myc epitope present on the EPAS1 protein. 
5 Under these conditions the c-Myc antibody was capable of immunoprecipitatmg the radiolabeled 
ARNT protein only when EPASl-Myc protein was present in the reaction. 

The bHLH domains of HIF-la and EPAS1 are nearly identical in primary amino acid 
sequence. Thus, to test for the ability of EPAS1 to form a functional heterodimer with ARNT, 
we used a HIF-la response element derived from the 3'-flanking region of the erythropoietin 
10 gene (Semenza and Wang, 1992) in gel mobility shift assays with in vitro translated proteins. 
The data showed that a new complex was formed when both EPAS1 and ARNT were included 
in the DNA binding reaction, and that this complex was specifically recognized by an 
anti-peptide antibody directed against the EPAS1 protein. Competition experiments using a 
100-fold excess of unlabeled competitor DNA containing the HIF-la response element, or a 
1 5 response element with three point mutations in this sequence, indicated that EPAS 1 exhibited 
sequence-specific binding properties. Taken together, the data indicate that EPAS1 is capable 
of binding the HIF-la response element in the presence of the ARNT protein. 

The ability of EPAS 1 to trans-activate a reporter gene containing the HIF-la response 
element was tested by transient transfection. Expression vectors in which either EPAS1, HIF-la, 
.0 or ARNT were placed under the control of a cytomegalovirus promoter were constructed. Two 
luciferase reporter constructs were prepared. One contained nucleotides -105 through -58 of the 
herpes simplex virus thymidine kinase promoter (McKnight et al. 1981) linked to three coptes 
of the HIF-la response element from the erythropoietin gene (pRE-tk-LUC). The other 
contained a TATA sequence from the adenovirus major late gene promoter (Lillie and Green, 
25 1989) linked to the same three HIF-la response elements (pElB-LUC). Combinations of these 
plasmids were then transfected into cultured human embryonic kidney 293 cells and the 
expression of luciferase enzyme activity was monitored in cell lysates 16-20 hours 
post-transfection. The data showed that EPAS1 induced a 12-fold increase in luciferase enzyme 
activity when transfected in the absence of the ARNT vector. Cotransfection of the ARNT 
30 expression vector with low levels of EPAS1 expression vector did not increase the 
EPASl-mediated induction of luciferase activity, suggesting that this cell line might contain 
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adequate amounts of endogenous ARNT 10 support heterodimer formation with EPAS1. A 
seven-fold stimulation of luciferase activity was also obtained when larger amounts of the HIF- 
la expression plasmid were introduced into 293 cells . The introduction of three point mutations 
into the core sequence of the hypoxia response element eliminated both EP AS 1 -dependent and 
HIF- la-dependent activation of the reporter gene. 
5 The potential of HIF- la to induce expression of target genes is increased by both hypoxia 

and pharmacological compounds that mimic hypoxia in cells, such as desferrioxamine (DFX) and 
cobalt chloride (CoCK) (Wang et al. 1995). To determine if EPAS1 activity might also be 
stimulated by these agents, 293 cells were incubated under hypoxic conditions or treated with 
DFX or CoCK prior to transfection with the plasmids. Pretreatment of cells under conditions that 
10 mimic hypoxia increased expression from the luciferase construct in the absence of exogenous 
EPAS1 or HIF- la. This trans-activation presumably arises from endogenous HIF- la or EPAS1 
proteins whose mRNAs are present in 293 cells. As noted above, introduction of the EPAS1 
expression vector led to 5- to 10 times higher levels of luciferase activity over those seen in 
mock-transfected ceiis. An extra 2 to 4-foid stimulation of luciferase expression was observed 
15 upon pretreatment with CoCK, DF, or hypoxia relative to that measured in EPAS1- transfected 
but untreated cells. Of the three conditions, pretreatment with CoCK led to a slightly larger 
increase in EPAS1 activity, resulting in a four- fold higher level of luciferase activity over that 
detected in untreated cells. As has been observed in previous studies (Jiang et al. 1996; Forsythe 
et al. 1996), hypoxic conditions also stimulated the ability of HIF- la to trans-activate the target 
20 gene containing the hypoxia response element. 

The EPAS1 expression vector was also tested for its ability to activate a reporter gene 
(pRE-Elb-LUC) following transfection into murine hepatoma cells (Hepalclc7) that express 
ARNT, as well as in a mutant line derived from these parental cells that does not express ARNT 
(c4 variant, Legraverend et al. 1982). Expression of EPAS1 in the Hepalclc7 cells led to a 
25 nine-fold increase in luciferase activity. Transfection of EPAS1 alone into c4 cells increased 
luciferase enzyme activity only slightly (K8-fold) whereas cotransfection of EPAS1 and ARNT 
led to a 12-fold stimulation of activity. These findings are consistent with the interpretation that 
EPAS1 forms an active heterodimeric transcription factor with ARNT, and they confirm the 
results showing heterodimerization of these two proteins obtained in coimmunoprecipitation and 
30 gel mobility shift assays. 

The experiments demonstrating the functional activity of EPAS1 utilized a hypoxia 
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response element derived from the erythropoietin gene, which is a known target gene for HIF-lct 
(Semenza and Wang, 1992). Despite the activity of EPAS1 in these assays, as well as the high 
degree of sequence similarity between HIF-la and EPAS1. the in situ mRNA hybridization 
results indicate that the two proteins are expressed in different cell types and thus might activate 
different target genes. The high level of expression of EPAS1 in endothelial cells raises the 
5 possibility that the EPAS1 protein might activate genes whose expression is limited to 
endothelial cells. To test this hypothesis, we transfected 293 cells with a c-Myc-tagged EPAS1 
express.on vector and a marker gene composed of the S'-flanking region of the Tie-2 gene linked 
to p-galactosidase. Tie-2 encodes a tyrosine kinase receptor that is specifically expressed in cells 
of endothelial lineage (Dumont et al. 1992; Maison-Pierre et al. 1993; Sato et al. 1993; Schnurch 
10 and Risau, 1993). The data showed that EPAS1 potently stimulated expression of the 
r/e-2-driven reporter gene, and that the degree of stimulation correlated with the level of 
immunodetectable EPAS1 in the transfected cells. Surprisingly, little or no transcriptional 
activation of the Tie-2 reporter gene by HIF-la was detected, even though equivalent amounts 
of HIF-la and EPAS1 proteins were expressed in the 293 cells. 
1 5 These data reveal 'that EPAS 1 proteins and nucleic acids provide reagents to modulate the 

formation of the endothelial tissues including vasculature, the blood brain barrier, etc. and to 
modulate cellular or tissue responsiveness to oxygenation, hypoxia and other hemodynamic 
stimuli. 

cDNA and genomic cloning, chromosomal mapping 

20 in the course of screening for genes that are differentially expressed in prostate 

adenocarcinoma versus normal tissue, a cDNA encoding a bHLH/PAS domain protein was 
isolated. Data base searches generated several expressed-sequence tags that showed sequence 
similarity to this family of transcription factors. EPAS1 cDNAs correspond to the human 
expressed sequence tag #T70415 in the Genbank collection and were isolated by a combination 

25 of reverse transcriptase polymerase chain reactions and screening of a HeLa cell cDNA library 
(Yokoyama et al. 1993) using standard methods. Similar approaches were used to isolate the 
murine homologue from a commercially available mouse adult brain cDNA library (#837314, 
Stratagene Corp., La Jolla, CA). A human HIF-la cDNA was generated by ligation of an 
amplified cDNA fragment to expressed sequence tag hbc025 (Takeda et al. 1 993). Bacteriophage 

30 A. clones harboring genomic DN A inserts corresponding to the human EPAS 1 gene were isolated 
by screening a commercially available fibroblast genomic library (XFIXII vector, #946204, 
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Stratagene Corp.) 

Fluorescence in situ hybridization to identify the chromosomal localization of the human 
EPAS1 gene was carried out as previously described (Craig and Bickmore, 1994). This analysis 
indicated hybridization to the short arm of chromosome 2, bands pi 6-21. To confirm the 
assignment, a 269 bp segment of exon 8 from the EPAS1 gene was amplified from the 83 
5 genomic DNAs of a radiation hybrid panel (Stanford G3 panel Research Genetics, Huntsville. 
AL) using oligonucleotide primers and a thermocycler program consisting of 35 cycles of 94°C/t 
min, 68°C/1 min. Analysis of the results via an e-mail server at Stanford University indicated 
linkage to the D2S288 marker (logarithm of the odds score of 8.7, cR_8000 value of 12.96), 
which is located approximately 82 centimorgans from the telomere of the short arm of 
10 chromosome 2 (MIT Center for Genome Research). 
RNA blotting and in situ hybridization 

Human multiple tissue RNA blots (Clontech Laboratories, Palo Alto, CA) were probed 
with EPAS1 and HIF-la cDNA probes using Rapid-Hyb from Amersham Corp. (Arlington 
Heights, IL). For in situ mRNA hybridization, mouse tissues were fixed in 4% 
15 paraformaldehyde, sectioned at 5 \im thickness, and subjected to in situ mRNA hybridization as 
described (Berman et al. 1995). A [ 33 P]-labeled antisense RNA probe recognizing the EPAS1 
mRNA was derived by in vitro transcription of an -300 bp DNA fragment encoding amino acids 
225-327 of the sequence shown in Table 1. A segment of the murine HIF-la cDNA encoding 
amino acids 41-125 was isolated by reverse transcriptase-polymerase chain reactions using 
20 mRNA template isolated from embryonic day 10 mouse embryo. 
Co-immunoprecipitation experiments 

Human EPAS1 and mouse ARNT proteins were generated in vitro using a 
transcription-translation kit (TNT System," Promega Corp., Madison, WI). cDNAs encoding full- 
length proteins were subcloned into the pcDNA3 vector (Invitrogen Corp., San Diego, CA) prior 
25 to coupled transcription/translation. For immunoprecipitation, approximately 5 \i\ of each 
reaction were transferred to a separate tube, mixed well and subsequently diluted by the addition 
of 500 ul of ice-cold buffer (20 mM Hepes-KOH, pH 7.4/ 100 mM KC1/ 10% (v/v) glycerol/ 
0.4% (v/v) Nonidet P-40/ 5 mM EGTA/ 5 mM EDTA/ 100 ng/ml bovine serum albumin/ 1 mM 
dithiothreitol) (Huang et al. 1993). The diluted mixture was incubated with 1 jil (0.1 fig) of 
30 anti-Myc monoclonal antibody 9E10 (Santa Cruz Biotechnology, Santa Cruz, CA) for 2 hours 
at 4 C C. A 10 \il aliquot of beads (-4 x 10 6 in number, Dynal Corp., Lake Success, NY) coated 
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with rat anti-mouse IgGl antibody were then added followed by a further incubation for 1 hour 
at 4°C. Beads were washed three times with 1 .5 ml of the above buffer and bound proteins were 
subsequently analyzed by electrophoresis through 8% polyacrylamide gels containing SDS. 

Gel retention assays 

EPAS1 and ARNT cDNAs were translated in vitro as described above. Gel retention 

5 assays were performed as described previously (Semenza and Wang, 1992) using a 
double-stranded oligonucleotide probe radiolabeled with the Klenow fragment of £. coli DNA 
polymerase 1 and containing an HIF-la binding site (5 f -GCCCTACGTGCTGTCTCA-3\ SEQ 
ID NO:3) from the erythropoietin gene (Semenza and Wang, 1992). For supershift assays, a 
polyclonal antibody was raised against residues 1 to 10 of the human EPAS1 protein by standard 

10 methods and 1 ul of serum was added to the gel retention reaction mixture prior to the 30 minute 
incubation at 4°C. A preimmune serum served as a negative antibody control. 
Transient transfection assays 

The pTK-RE3-luc reporter plasmid was constructed by inserting three copies of a 
50-nucleotide hypoxia-inducible enhancer from the erythropoietin gene (Semenza and Wang, 

15 1992) into pGL3-TK. The 77e-2-P-galactosidase reporter gene pT2HLacZpAH.7, containing 
10.3 kb of 5'-flanking DNA from the murine Tie-2 gene was obtained from the Cardiovascular 
Division, Beth Israel Hospital, Boston, MA. Human embryonic kidney 293 cells (ATCC 
CRL#1573) were cultured in Dulbecco's modified Eagle's medium (DMEM, low glucose; 
Gibco-BRL) supplemented with 10% fetal calf serum. The murine hepatoma cell line Hepalclc7 

20 and the c4 .ARNT deficient mutant derived from this line were maintained as described 
previously (Legraverend et al. 1982). Approximately 24 hours before transfection. cells were 
inoculated in 12-well plates at a density of 120,000 cells per well. Plasmid DNA (1-10 fig) was 
transfected into cells using a kit (MBS, Stratagene Corp., La Jolla, CA). Cells were allowed to 
recover for 3 hours at 35°C in a 3% CO : atmosphere. Where indicated, 125 ^iM CoCl 2 (#C3169, 

25 Sisrma Chem. Corp., St. Louis. MO) or 130 uM desferoxamine (#D9533, Sigma) were added 
to cells at this time and the incubation continued for an additional 16 hours in atmospheres 
containing 20% or 1% 0 2 . Luciferase and p-galactosidase enzyme activities were determined 
according to the manufacturer's instructions (Tropix, Bedford, MA). Reporter gene expression 
was normalized by cotransfection of a (3-galactosidase expression vector (pCMV-p-gal) and/or 

30 to expression obtained from the pGL3-Control plasmid (Promega Corp., Madison, WI). Levels 
of expressed c-Myc epitope-tagged EPAS1 or HIF-la were assessed by immunoblotting with 
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the anti-Myc monoclonal antibody 9E10 (Santa Cruz Biotechnology, Santa Cruz, CA) using a 
protocol supplied by the manufacturer. 
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EXAMPLES 



30 



1. 



Protocol for high throughput EPAS1-ARNT complex formation assay. 



A. 



Reagents: 
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- Mpiitmlite Avidin : 20 (ig/ml in PBS. 

- Rlnrking buffer : 5% BSA, 0.5% Tween 20 in PBS: 1 hour at room temperature. 

- Assay Buffer 100 mM KC1. 20 mM HEPES pH 7.6, I mM MgCl : . 1% glycerol, 0.5% 
NP-40, 50 mM p-mercaptoethanol, 1 mg/ml BSA. cocktail of protease inhibitors. 

-tt PFPASl protein lOx stock : 10" 8 - 10" (> M "cold" EPAS1 supplemented with 200,000- 
5 250,000 cpm of labeled EPAS1 (Beckman counter). Place in the 4°C microfridge during 
screening. 

■ Pm^ gP inhibit or retail (1 000XY . 10 mg Trypsin Inhibitor (BMB # 109894), 10 mg 
Aprotinin (BMB # 236624), 25 mg Benzamidine Mgma # B-6506), 25 mg Leupeptin (BMB # 
1017128), 10 mg APMSF (BMB # 917575), and 2;:iM NaVo 3 (Sigma # S-6508) in 10 ml of PBS. 
10 . ARNT: 10' 7 - 10 5 M biotinylated ARNT in PBS. 

B. Preparation of assay plates: 

- Coat with 120 ul of stock N-Avidin per well overnight at 4°C. 

- Wash 2 times with 200 ul PBS. 

15- - Wash 2 times with 200 fil PBS. 

C. Assay: 

- Add 40 |il assay buffer/well. 

- Add 10 jil compound or extract. 

- Add 10 jil 33 P-EPAS1 protein (20-25.000 cpm/0.1-10 pmoles/well = 10°- 10" 7 M final 

20 cone). 

- Shake at 25°C for 15 minutes. 

- Incubate additional 45 minutes at 25°C. 

- Add 40 |xl biotinylated hTFII subunit (0.1-1*0 pmoles/40 ul in assay buffer) 

- Incubate 1 hour at room temperature. 

25 - Stop the reaction by washing 4 times with 200 ul PBS. 

- Add 150 \x\ scintillation cocktail. 

- Count in Topcount. 

D. Controls for all assays (located on each plate): 
a. Non-specific binding 
30 b. Soluble (non-biotiny lated EP AS 1 ) at 80% inhibition. 
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2. Protocol for high throughput human EPAS 1/ARNT- DNA complex formation assay. 
A. Reagents: 

- Neutralite Avidin : 20 jig/ml in PBS. 

- Rlocking buffer : 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room temperature. 

- Assay Buffer : 100 mM KC1, 20 mM HEPES pH 7.6, 1 mM MgCl : , 1% glycerol, 0.5% 
5 NP-40, 50 mM (3-mercaptoethanol. 1 mg/ml BSA, cocktail of protease inhibitors. 

. tt p human EPAS1 protein lOx stock : 10* 8 - 10" 6 M "cold" human EPAS1 subunit (pi 05) 
supplemented with 200,000-250.000 cpm of labeled human EPAS1 (Beckman counter). Place 
in the 4°C micro fridge during screening. 



10 Aprotinin (BMB # 236624), 25 mg Benzamidine (Sigma # B-6506), 25 mg Leupeptin (BMB # 
1017128), 10 mg APMSF (BMB # 917575), and 2mM NaVo 3 (Sigma # S-6508) in 10 ml of PBS. 
- DNA : 10 ? - 10- J MbiotinylatedDNA(SEQIDNO:3) in PBS. 
- ARNT protein : 10" 7 - 10' 5 M ARNT in PBS. 
B. Preparation of assay plates: 
15 - Coat with 120 (il of stock N- Avidin per well overnight at 4°C. 



. Protease inhibitor cocktail (1000X1 : 10 mg Trypsin Inhibitor (BMB # 109894), 10 mg 



- Wash 2 times with 200 \il PBS. 

- Block with 150 ul of blocking buffer. 



C. 



- Wash 2 times with 200 ul PBS. 
Assay: 



20 



- Add 40 [il assay buffer/well. 

- Add 10 |il compound or extract. 



25 



- Add 10 ill 33 P-h EPAS1 protein (20-25,000 cpm/0.1-10 pmoles/well =10' y - 10' 7 M final). 

- Add IOjj.1 ARNT protein. 

- Shake at 25°C for 15 minutes. 

- Incubate additional 45 minutes at 25°C. 

- Add 40 ul biotinylated DNA (0.1-10 pmoles/40 ul in assay buffer) 



- Incubate 1 hour at room temperature. 



30 



D. 



- Stop the reaction by washing 4 times with 200 ul PBS. 

- Add 150 ul scintillation cocktail. 

- Count in Topcount. 

Controls for all assays (located on each plate): 
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a. Non-specific binding 

b. Soluble (non-biotinylated EPAS1/ARNT combination) at 80% inhibition. 

All publications and patent applications cited in this specification are herein incorporated 
bv reference as if each individual publication or patent application were specifically and 
.ndividually indicated to be incorporated by reference. Although the forego.ng invention has 
been described in some detail by way of illustration and example for purposes of clarity of 
understanding it will be readily apparent to those of ordinary skill in the art in light of the 
teachings of this invention that certain changes and modifications may be made thereto without 
departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 

{1) GENERAL INFORMATION: 

(i) APPLICANT: McKnight , Steven L. 

Russell, David W. 
Tian, Hui 

5 (ii) TITLE OF INVENTION: Endothelial PAS Domain Protein 

(iii) NUMBER OF SEQUENCES: 7 
(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: SCIENCE Sc TECHNOLOGY LAW GROUP 
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(D) STATE : CALIFORNIA 

(E) COUNTRY: USA 
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15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: I3M PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 
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20 (A) APPLICATION NUMBER: US 08/785,241 

(B) FILING DATE: 17-JAN-1997 
<C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: OSMAN, RICHARD A 
25 (B) REGISTRATION NUMBER: 36,627 

(C) REFERENCE /DOCKET NUMBER: UTSD:1229 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 343-4341 
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30 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2816 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

„,___,-, Tr r-r-C— GCG CGCACCTCGG ACCTTCACCA CCCGCCCGGG 60 
CCTGACTGCG CGbGu^^^TC Cjobrttu.juo ^ou^^^ - ^v_^. 

„ n „ nr „^~ c-rsi-GiGGC- -"^CAGCCCC CCACCCGCCA GGGAGCCCAG GTGCTCGGCG 120 
TCTG AAC G T C TCAAAGGGCC ACAGCGACAA TGACAGCTGA CAAGGAGAAG AAAAGGAGTA 
GCTCGGAGAG GAGGAAGGAG AAGTCCCGGG ATGCTGCGCG GTGCCGGCGG AGCAAGGAGA 
5 CGGAGGTGTT CTATGAGCTG GCCCATGAGC TGCCTCTGCC CCACAGTGTG AGCTCCCATC 

TGGACAAGGC CTCCATCATG CGACTGGAAA TCAGCTTCCT GCGAACACAC AAGCTCCTCT 
CCTCAGTTTG CTCTGAAAAC GAGTCCGAAG CCGAAGCTGA CCAGCAGATG GACAACTTGT 
ACCTGAAAGC CTTGGAGGGT TTCATTGCCG TGGTGACCCA AGATGGCGAC ATGATCTTTC 
TGTCAGAAAA CATCAGCAAG TTCATGGGAC TTACACAGGT GGAGCTAACA GGACATAGTA 
10 TCTTTGACTT CACTCATCCC TGCGACCATG AGGAGATTCG TGAGAACCTG AGTCTCAAAA 

ATGGCTCTGG TTTTGGGAAA AAAAGCAAAG ACATGTCCAC AGAGCGGGAC TTCTTCATGA 
GGATGAAGTG CACGGTCACC AACAGAGGCC GTACTGTCAA CCTCAAGTCA GCCACCTGGA 
AGGTCTTGCA CTGCACGGGC CAGGTGAAAG TCTACAACAA CTGCCCTCCT CACAATAGTC 
TGTGTGGCTA CAAGGAGCCC CTGCTGTCCT GCCTCATCAT CATGTGTGAA CCAATCCAGC 
15 ACCCATCCCA CATGGACATC CCCCTGGATA GCAAGACCTT CCTGAGCCGC CACAGCATGG 

ACATGAAGTT CACCTACTGT GATGACAGAA TCACAGAACT GATTGGTTAC CACCCTGAGG 

™ *r.~isT*r*nnnr nr.arTr^ar, AACATGACCA 1020 

AGCTGCTTGG CCGCTCJALitJU" lAiuhrtULi ^w^w^ww- 

AGAGTCACCA GAACTTGTGC ACCAAGGGTC AGGTAGTAAG TGGCCAGTAC CGGATGCTCG 
CAAAGCATGG GGGCTACGTG TGGCTGGAGA CCCAGGGGAC GGTCATCTAC AACCCTCGCA 
20 ACCTGCAGCC CCAGTGCATC ATGTGTGTCA ACTACGTCCT GAGTGAGATT GAGAAGAATG 

ACGTGGTGTT CTCCATGGAC CAGACTGAAT CCCTGTTCAA GQCCCACCTG ATGGCCATGA 
ACAGCATCTT TGATAGCAGT GGCAAGGGGG CTGTGTCTGA GAAGAGTAAC TTCCTATTCA 
CCAAGCTAAA GGAGGAGCCC GAGGAGCTGG CCCAGCTGGC TCCCACCCCA GGAGACGCCA 
TCATCTCTCT GGATTTCGGG AATCAGAACT TCGAGGAGTC CTCAGCCTAT GGCAAGGCCA 
25 TCCTGCCCCC GAG CC AG CCA. TGGGCCACGG AGTTGAGGAG CCACAGCACC CAGAGCGAGG 

CTGGGAGCCT GCCTGCCTTC ACCGTGCCCC AGGCAGCTGC CCCGGGCAGC ACCACCCCCA 
GTGCCACCAG CAGCAGCAGC AGCTGCTCCA CGCCCAATAG CCCTGAAGAC TATTACACAT 
CTTTGGATAA CGACCTGAAG ATTGAAGTGA TTGAGAAGCT CTTCGCCATG GACACAGAGG 
CCAAGGACCA ATGCAGTACC CAGACGGATT TCAATGAGCT GGACTTGGAG ACACTGGCAC 
30 CCTATATCCC CATGGACGGG GAAGACTTCC AGCTAAGCCC CATCTGCCCC GAGGAGCGGC 

TCTTGGCGGA GAACCCACAG TCCACCCCCC AGCACTGCTT CAGTGCCATG ACAAACATCT 
TCCAGCCACT GGCCCCTGTA GCCCCGCACA GTCCCTTCCT CCTGGACAAG TTTCAGCAGC 
AGCTGGAGAG CAAGAAGACA GAGCCCGAGC ACCGGCCCAT GTCCTCCATC TTCTTTGATG 
CCGGAAGCAA. AGCATCCCTG CCACCGTGCT GTGGCCAGGC CAGCACCCCT CTCTCTTCCA 
35 TGGGGGGCAG ATCCAATACC CAGTGGCCCC CAGATCCACC ATTACATTTT GGGCCCACAA 

AGTGGGCCGT CGGGGATCAG CGCACAGAGT TCTTGGGAGC AGCGCCGTTG GGGCCCCCTG 
TCTCTCCACC CCATGTCTCC ACCTTCAAGA CAAGGTCTGC AAAGGGTTTT GGGGCTCGAG 

24 



ISO 
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300 

360 

420 

480 
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600 
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720 

780 

840 

900 

960 



1080 
1140 
1200 
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1320 
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1500 
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GCCCAGACGT 


GCTGAGTCCG 


GCCATGGTAG 


CCGTCTCCAA 


CAAGCTGAAG 


C i G AAGCGAC 


2280 


AGCTGGAGTA 


T G AAG AG C AA 


GCCTTCCAGG 


ACCTGAGCGG 


GGGGGACGCA 


CCTGGTGG^.rt 


2340 


G C AC C T C A C A 


TTTGATGTGG 


AAAC G G AT G A 


AGAACCTCAG 


GGGTGGGAGC 


TGCCCTTTGA 


2400 


TGCCGGACAA 


GCCACTGAGC 


GCAAATGTAC 


CCAATGATAA 


GTTCACCCAA 


AACCCCATGA 


2460 


GGGGCCTGGG 


CCATCCCCTG 


AGACATCTGC 


CGCTGCCACA 


GCGTCCATCT 


GCCATCAGTC 


2520 


CCGGGGAGAA. 


CAGCAAGAGC 


AGGTTCCGCC 


CACAGTGCTA 


CGCCACCCAG 


TACCAGGACT 


2580 


ACAGCCTGTC 


GTCAGCCCAC 


AAGGTGTCAG 


GCATGGCAAG 


CCGGCTGCTC 


GGGCCCTCAT 


2640 


TTGAGTCCTA 


CCTGCTGCCC 


GAACTGACCA 


GATATGACTG 


TGAGGTGAAC 


GTGCCCGTGC 


2700 


TGGGAAGCTC 


CACGCTCCTG 


CAAGGA.GGGG 


ACCTCCTCAG 


AGCCCTGGAC 


CAGGCCACCT 


2760 


GAGCCAGGCC 


TTCTACCTGG 


GCAGCACCTC 


TGCGGACGCC 


GTGCCACCAG 


CTTGAC 


2816 



(2) INFORMATION FOR SEQ ID MO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3031 base pairs 
{ b ) TYPE : nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE : cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



20 



30 



35 



C G AC AG AG AG 


CTGCGGAGGG 


CC AC AG C AAA 


GAGAGCGGCT 


GCAGCCCCTA 


CGGGGTTAAG 


60 


GAACCCAGGT 


GCTCCGGGTC 


TCGGAGGGCC 


ACGGCGACAA 


TGACAGCTGA 


C AAG GAG AAA 


120 


AAAAGGAGCA 


GCTCAGAGCT 


GAGGAAGGAG 


AAATCCCGTG 


ATGCCGCGAG 


GTGCCGGCGC 


180 


AGCAAGGAGA 


CGGAGGTCTT 


CTATGAGTTG 


GCTCATGAGT 


TGCCCCTGCC 


TCACAGTGTG 


240 


AGCTCCCACC 


TGGACAAAGC 


CTCCATCATG 


CGCCTGGCCA 


TCAGCTTCCT 


TCGGACACAT 


300 


AAGCTCCTGT 


CCTCA.GTCTG 


CTCTGAAAAT 


GAATCTGAAG 


CTGAGGCCGA 


CCAGCAAATG 


360 


GATAACTTGT 


ACCTGAAAGC 


CTTGGAGGGT 


TTCATTGCTG 


TGGTGACCCA 


AGACGGTGAC 


420 


ATGATCTTTC 


TGTCGGAAAA 


CATC AG C AAG 


TTCATGGGAC 


TTACTCAGGT 


AGAACTAACA 


480 


GGACACAGCA 


TCTTTGACTT 


CACTCATCCT 


TGCGACCATG 


AAGAGATCCG 


TGAGAACCTG 


540 


ACTCTCAAAA 


ACGGCTCTGG 


TTTTGGGAAG 


AAGAGCAAAG 


ACGTGTCCAC 


CGAGCGTGAC 


600 


TTCTTCATGA 


GGATGAAGTG 


CACGGTCACC 


AACAGAGGCC 


GGACTGTCAA 


CCTCAAGTCG 


660 


GCCACCTGGA 


AGTCCGTCCT 


GCACTGCACC 


GGGCAAGTGA 


GAGTCTACAA 


C.AACTGCCCC 


720 


CCTCACAGTA 


GCCTCTGTGG 


CTCCAAGGAG 


CCCCTGCTGT 


CCTGCCTTAT 


CATCATGTGT 


780 


GAGCCAATCC 


AGCACCCATC 


CCACATGGAC 


ATCCCCCTGG 


ACAGCAAGAC 


TTTCCTGAGC 


840 


CGCCACAGCA 


TGGACATGAA 


GTTCACCTAC 


TGTGACGACA 


GAATCTTGGA 


ACTGATTGGT 


900 


TACCACCCCG 


AGGAGCTACT 


TGGACGCTCT 


GCCTATGAGT 


TTTACCATGC 


CCTGGATTCG 


960 


GAGAJVCATGA 


CCAAAAGTCA 


CCAGAACTTG 


TGCACCAAGG 


GGCAGGTGGT 


ATCTGGCCAG 


1020 


TACCGGATGC 


TAG CC AAAC A 


CGGAGGATAT 


GTGTGGCTGG 


AGACCCAGGG 


GACGGTCATC 


1080 


TACAACCCCC 


GCAACCTGCA 


GCCTCAGTGT 


ATCATGTGTG 


TCAACTATGT 


GCTGAGTGAG 


1140 
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ATCGAGAAGA ACGACGTGGT GTTCTCCATG GACCAGACCG AATCCCTGTT CAAGCCACAC 12 0 0 

CTGATGGCCA TGAACAGCAT CTTTGACAGC AGTGACGATG TGGCTGTAAC TGAGAAGAGC 12 6 0 

AACTACCTG7 TCACCAAACT GAAGGAGGAG CCCGAGGAAC TGGCCCAGTT GGCCCCCACC 1320 

CCAGGAGATG GCATTATTTC TCTCGATTTC GGAAGCCAGA ACTTCGATGA ACCCTCAGCC 13 80 

TATGGCAAGG CCATCCTTCC CCCGGGCCAG CCATGGGTCT CGGGGCTGAG GAGCCACAGT 14 4 0 

5 GCCCAGAGCG AGTCCGGGAG CCTGCCAGCC TTCACTGTGC CCCAGGCAGA CACCCCAGGG 1500 

AACACTACAC CCAGTGCTTC AAGCAGCAGT AGCTGCTCCA CGCGCAGCAG CCCTGAGGAC 1560 

TACTATTCAT CCTTGGAGAA TCCCTTGAAG ATCGAAGTGA TTGAGAAGCT TTTCGCCATG 162 0 

GACACGGAGC CGAGGGACCC GGGCAGTACC CAGACGGACT TCAGTGAACT GGATTTGGAG 1680 

ACCTTGGCAC CCTACATCCC TATGGACGGC GAGGACTTCC AGCTGAGCCC CATCTGCCCA 174 0 

10 GAGGAGCCGC TCATGCCAGA GAGCCCCCAG CCCACCCCCC AGCACTGCTT CAGTACCATG 1S0 0 

ACCAGCATCT TCCAGCCGCT CACCCCGGGG GCCACCCACG GCCCCTTCTT CCTCGATAAG 1860 

TACCGGCAGC AGTTGGAAAG CAGGAAGACA GAGTCTGAGC ACTGGCCCAT GTCTTCCATC 1920 

TTCTTTGATG CTGGGAGCAA AGGGTCCCTG TCTCCATGCT GTGGCCAGGC CAGCACCCCT 1980 

CTCTCTTCTA TGGGAGGCAG ATCCAACACG CAGTGGCCCC CGGATCCACC ATTACATTTC 2 04 0 

15 GGCCCTACTA AGTGGCCTGT GGGTGATCAG AGTGCTGAAT CCCTGGGAGC CCTGCCGGTG 2100 

GGGTCATGGC AGTTGGAACT TCCGAGCGCC CCGCTTCATG TCTCCATGTT CAAGATGAGG 2160 

. _ . ^^r^^m* /-us i*r> * nnr»7v r*r*nr* Arr^TiT nATmrrrTP, 9 9 3 0 

TCTGCAAAGG AU l lL'UUijUL LU^^oicun j. ± w*-i. ^www^wwv-* 

TCCAACAAGC TGAAGCTAAA GCGGCAGCTG GAGTATGAGG AG C AAGCCTT CCAAGACACA 22 80 

AGCGGGGGGG ACCCTCCAGG CACCAGCAGT TCACACTTGA TGTGGAAACG TATGAAGAGC - 2 3 40 

20 CTCATGGGCG GGACCTGTCC TTTGATGCCT GACAAGACCA TCAGTGCGAA CATGGCCCCC 24 00 

GATGAATTCA CCCAAAAATC TATGAGAGGC CTGGGCCAGC CACTGAGACA CCTGCCACCT 2 4 60 

CCCCAGCCAC CATCTACCAG GAGCTCAGGG GAGAACGCCA AGACTGGGTT CCCGCCACAG 2 520 

TGCTATGCCT CCCAGTTCCA GGACTACGGT CCTCCAGGAG CTCAAAAGGT GTCAGGCGTG 2 580 

GCCAGTCGAC TGCTGGGGCC ATCGTTCGAG CCTTACCTGT TGCCGGAACT G AC C AG AT AT 2 64 0 

25 GACTGTGAGG TGAACGTGCC CGTGCCTGGA AGCTCCACAC T C CTG CAGGG GAGAGACCTT 2 70 0 

CTCAGAGCTC TGGACCAGGC CACCTGAGCC AGGGCCTCTG GCCGGGCATG CCCCTGCCTG 2 76 0 

,~ rnm^^^^mnnr* * r ^ A r-nrT pru TrTHTHT THTTATTAGG TATCTCTAAC 2 82 0 

ACCAGCACAC TTCTTACGAG ATGTACTCAA CCTGGCCTAC TGGCCAGGTC ACCAAGCAGT 2 8 80 

GGCCTTTATC TGACATGCTC ACTTTATTAT CCATGTTTTA AAAATACATA GTTGTTGTAC 2 94 0 

30 CTGCTATGTT TTAC CGTTG A TGAAAGTGTT CTGAAATTTT ATAAGATTTC CCCCTCCCTC 3 00 0 

CCTCCCTTGA ATTACTTCTA ATTTATATTC C 303 1 

(2) INFORMATION FOR ""Q ID NO : 3 : 

(i) SEQUENCE CH. -.:^ACTERISTICS : 
35 (A) LENGTH: 18 base pairs 

(3} TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xiJ SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 

18 

GCCCTACGTG CTGTCTCA 

5 (2) INFORMATION FOR SEQ ID MO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
jO (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Thr Ala Asp Lys Glu Lys Lys Arg Ser Ser Ser Glu Arg Arg Lys 

1 

Ala Ala Arg Cys Arg Arg Ser Lys Glu Thr Glu 



25 



35 



nc 30 

20 25 



15 Glu Lys Ser Arg Asp 

20 

Val Phe Tyr Glu Leu Ala His Glu Leu Pro Leu Pro His Ser Val Ser 

35 40 45 

Ser His Leu Asp Lys Ala Ser He Met Arg Leu Glu He Ser Phe Leu 

rq 60 

20 50 55 

Arg Thr His Lys Leu Leu Ser Ser Val Cys Ser Glu Asn Glu Ser Glu 



65 
Ala 



70 



75 



80 



Glu Ala Asp Gin Gin Met Asp Asn Leu Tyr Leu Lys Ala Leu Glu 



85 



90 



95 



Gly Phe He Ala Val Val Thr Gin Asp Gly Asp Met lie Phe Leu Ser 



100 



105 



110 



Glu Asn lie Ser Lys Phe Met Gly Leu Thr Gin Val Glu Leu Thr Gly 

115 I 20 125 

His Ser He Phe Asp Phe Thr His Pro Cys Asp His Glu Glu He Arg 

n r 140 

30 130 135 

Glu Asn Leu Ser Leu Lys Asn Gly Ser Gly Phe Gly Lys Lys Ser Lys 

ice; 160 
145 ISO I 55 

Asp Met Ser Thr Glu Arg Asp Phe Phe Met Arg Met Lys Cys Thr Val 
165 170 175 



Thr Asn Arg Gly Arg Thr Val Asn Leu Lys Ser Ala Thr Trp Lys Val 

180 1" 190 

Leu His cys Thr Gly Gin Val Lys Val Tyr Asn Asn Cys Pro Pro His 

27 



3NS0OCI0: <WO 9831 701 A1 _!_> 



WO 98/31701 



PCT/US98/00813 



195 200 205 

Asn Ser Leu Cys Gly Tyr Lys Glu Pro Leu Leu Ser Cys Leu lie lie 

210 215 220 

Met Cys Glu Pro lie Gin His Pro Ser His Met Asp lie Pro Leu Asp 
225 230 235 240 

5 Ser Lys Thr Phe Leu Ser Arg His Ser Met Asp Met Lys Phe Thr Tyr 

245 250 255 

Cys Asp Asp A-rg lie Thr Glu Leu lie Gly Tyr His Pro Glu Glu Leu 

260 265 270 

Leu Gly Arg Ser Ala Tyr Glu Phe Tyr His Ala Leu Asp Ser Glu Asn 
10 275 280 285 

Met Thr Lys Ser His Gin Asn Leu Cys Thr Lys Gly Gin Val Val Ser 

290 295 300 

Gly Gin Tyr Arg Met Leu Ala Lys His Gly Gly Tyr Val Trp Leu Glu 
305 310 315 320 

15 Thr Gin Gly Thr Val lie Tyr Asn Pro Arg Asn Leu Gin Pro Gin Cys 

325 330 335 

lie Met Cys Val Asn Tyr Val Leu Ser Glu lie Glu Lys Asn Asp Val 

340 345 350 

Val Phe Ser Met Asp Gin Thr Glu Ser Leu Phe Lys Pro His Leu Met: 
20 355 360 365 

Ala Met Asn Ser lie Phe Asp Ser Ser Gly Lys Gly Ala Val Ser Glu 

370 375 380 

Lys Ser Asn Phe Leu Phe Thr Lys Leu Lys Glu Glu Pro Glu Glu Leu 
385 390 395 400 

25 Ala Gin Leu Ala Pro Thr Pro Gly Asp Ala lie lie Ser Leu Asp Phe 

405 410 415 

Gly Asn Gin Asn Phe Glu Glu Ser Ser Ala Tyr Gly Lys Ala lie Leu 

420 425 430 

Pro Pro Ser Gin Pro Trp Ala Thr Glu Leu Arg Ser His Ser Thr Gin 
30 435 440 445 

Ser Glu Ala Gly Ser Leu Pro Ala Phe Thr Val Pro Gin Ala Ala Ala 

450 455 460 

Pro Gly Ser Thr Thr Pro Ser Aia Thr Ser Ser Ser Ser Ser Cys Ser 
465 470 475 480 

35 Thr Pro Asn Ser Pro Glu Asp Tyr Tyr Thr Ser Leu Asp Asn Asp Leu 

485 490 495 

Lys lie Glu Val lie Glu Lys Leu Phe Ala Met Asp Thr Glu Ala Lys 
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10 



15 



20 



30 



35 



500 



505 



5x0 



Asp Glr. Cys Se: 



Thr Gin Thr Asp Phe Asn Glu Leu As? Leu Glu Thr 
c 15 520 "5 

Leu Ala Pre Tyr lie Pro Met Asp Gly Glu As P Phe Gin Leu Ser Pro 



530 535 



540 



„ 9 Pro Glu Glu Arg Leu Leu Ala Glu Asn Pro Gin Ser Thr Pro 



cqc 560 
545 ^ *** 



He Cys Pro 
545 

Gin His Cvs Phe Ser Aia Me, Thr Asn He Phe Gin Pro Leu Ala Pro 
,1. Pro His Ser Pro Phe Leu Leu Asp Lys Phe Gin Gin Gin Leu 



Val 



580 585 



Glu ser Lys Lys Thr Glu Pro Glu His Arg Pro Met Ser Ser He Phe 
595 600 605 



Phe Asp Ala Gly Ser Lys Ala Ser Leu Pro Pro Cys Cys Gly Gin Ala 

610 

ser Pro Leu Ser Ser Met Gly Gly Arg Ser Asn Thr Gin Trp Pro 



625 



630 



635 



640 



Pro Asp Pro Pro Leu His Phe Gly Pro Thr Lys Trp Ala Val Gly Asp 



Gin Arg Thr G 



645 6S0 655 

3lu Phe Leu Gly Ala Ala Pro Leu Gly Pro Pro Val Ser 



660 



665 



670 



Pro Pro His Val Ser thr Phe Lys Thr Arg Ser Ala Lys Gly Phe Gly 

675 680 S85 

Ala Arg Gly Pro Asp Val Leu Ser Pro Ala Met Val Ala Leu Ser Asn 

690 693 
L vs Leu Lys Leu Lys Arg Gin Leu Glu Tyr Glu Glu Gin Ala Phe Gin 

7 i 5 720 

705 710 

As. Leu ser Gly Gly Asp Pro Pro Gly Gly Ser Thr Ser His Leu Met 
Trp Lys Arg Met Lys Asn Leu Arg Gly Gly Ser Cys Pro Leu Met Pro 



740 



745 



Asp Lys 



Lys Pro Leu Ser Ala Asn Val Pro Asn Asp Lys 



750 

Phe Thr Gin Asn 



755 



760 



765 



Pro Met Arg Gly Leu Gly Hi. Pro Leu Arg H,s Leu Pro Leu Pro Gin 
770 

o ro pro Ser Ala He Ser Pro Gly Glu Asn Ser Lys Ser Arg Phe Pro 



790 795 



800 

785 

Ala Thr Gin Tyr Gin Asp Tyr Ser Leu Ser Ser Ala 



Pro Gin Cys Tyr 
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805 810 815 

Kis Lys Val Ser Giy Met Ala Ser Arg Leu Leu Gly Pro Ser Phe Glu 

S20 825 830 

Ser Tyr Leu Leu Pro Glu Leu Thr Arg Tyr Asp Cys Glu Val Asn Val 

335 840 845 

Pro Val Leu Gly Ser Ser Thr Leu Leu Gin Gly Gly Asp Leu Leu Arg 

850 855 850 

Ala Leu Asp Gin Ala Thr 
365 870 

(2) INFORMATION FOR SEQ ID NO : 5 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 875 amino acids 
(S) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Ala Asp Lys Glu Lys Lys Arg Ser Ser Ser Glu Leu Arg Lys 

15 10 15 

Glu Lys Ser Arg Asp Ala Ala Arg Cys Arg Arg Ser Lys Glu Thr Glu 

20 25 30 

Val Phe Tyr Glu Leu Ala His Glu Leu Pro Leu Pro His Ser Val Ser 

35 40 45 

Ser His Leu Asp Lys Ala Ser lie Met Arg Leu Ala lie Ser Phe Leu 

50 55 60 

Arg Thr His Lys Leu Leu Ser Ser Val Cys Ser Glu Asn Glu Ser Glu 
65 70 75 80 

Ala Glu Ala Asp Gin Gin Met Asp Asn Leu Tyr Leu Lys Ala Leu Glu 

85 90 95 

Gly Phe lie Ala Val Val Thr Gin Asp Gly Asp Met lie Phe Leu Ser 

100 105 110 

Glu Asn lie Ser Lys Phe Met Gly Leu Thr Gin Val Glu Leu Thr Gly 

115 120 125 

His Ser lie Phe Asp Phe Thr His Pro Cys Asp His Glu Glu He Arg 

130 135 140 

Glu Asn Leu Thr Leu Lys Asn Gly Ser Gly Phe Gly Lys Lys Ser Lys 
145 150 155 160 

30 



20 



2D 



30 
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Asp val Ser Thr Glu Arg Asp Phe ?he Met Arg Met Lys Cys Thr Val 



165" ' 170 175 



Thr Asn Arg Gly Arg Thr Val Asn Leu Lys Ser Ala Thr Trp Lys Ser 

i R C 190 

180 185 
Val Leu His Cys Thr Gly Gin Val Arg Val Tyr Asn Asn Cys Pro Pre 
195 200 205 

Ser Leu Cys Gly Ser Lys Glu Pro Leu Leu Ser Cys Leu He 



Kis Ser 
210 



215 



220 



Tlfl r1 „ H <- p r o Se^ Fis Met Asp He Pro Leu 
He Met Cys Glu Pro He Gin tus Fro 

240 

225 230 



10 Asp 



Ser Lys Thr Phe Leu Ser Arg His Ser Met Asp Met Lys Phe Thr 



245 



250 



255 



Tyr Cys Asp Asp Arg He Leu Glu Leu He Gly Tyr His Pro Glu Glu 



260 



265 



270 



Leu Leu Gly Arg Ser Ala Tyr Glu Phe Tyr His Ala Leu Asp Ser Glu 
l5 275 280 285 

Asn Met Thr Lys Ser His Gin Asn Leu Cys Thr Lys Gly Gin Val Val 

290 295 300 

Ser Gly Gin Tyr Arg Met Leu Ala Lys His Gly Gly Tyr Val Trp Leu 

320 

305 310 315 

Glu Thr Gin Gly Thr Val lie Tyr Asn Pro Arg Asn Leu Gin Pro Gin 



325 330 



Cys lie Met Cvs Val Asn Tyr Val Leu Ser Glu He Glu Lys Asn Asp 

345 350 
Val Val Phe Ser Men Asp Gin Thr Glu Ser Leu Phe Lys Pro His Leu 

360 

Met Ala Met Asn Ser He Phe Asp Ser Ser Asp Asp Val Ala Val Thr 

370 375 . 380 

Glu Lys Ser Asn Tyr Leu Phe Thr Lys Leu Lys Glu Glu Pro Glu Glu 

7 QE 400 

385 390 Ji?-> 

Leu Ala Gin Leu Ala Pro Thr Pro Gly Asp Ala lie He Ser Leu Asp 

405 410 415 

Phe Gly Ser Gin Asn Phe Asp Glu Pro Ser Ala Tyr Gly Lys Ala lie 



420 



425 430 



Leu Pro Pro Gly Gin Pro Trp Val Ser Gly Leu Arg Ser His Ser Ala 

35 435 440 445 

Gin Ser Glu Ser Gly Ser Leu Pro Ala Phe Thr Val Pro Gin Ala Asp 

450 455 460 
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Thr Pro Gly Asn Thr Thr Pro Ser Ala Ser Ser Ser Ser Ser Cys Ser 
465 470 475 480 

Thr Pro Ser Ser Pro Glu Asp Tyr Tyr Ser Ser Leu G'iu Asn Pro Leu 

485 490 495 

Lys lie Glu Val lie Glu Lys Leu Phe Ala Met Asp Thr Glu Pro Arg 
5 500 505 510 

Asp Pro Gly Ser Thr Gin Thr Asp Phe Ser Glu Leu Asp Leu Glu Thr 

515 520 525 

Leu Ala Pro Tyr lie Pro Met Asp Gly Glu Asp Phe Gin Leu Ser Pro 
530 535 540 

10 lie Cys Pro Glu Glu Pro Leu Met Pro Glu Ser Pro Gin Pro Thr Pro 

545 550 555 560 

Gin His Cys Phe Ser Thr Met Thr Ser lie Phe Gin Pro Leu Thr Pro 

565 570 575 

Gly Ala Thr His Gly Pro Phe Phe Leu Asp Lys Tyr Pro Gin Gin Leu 
15 580 585 590 

Glu Ser Arg Lys Thr Glu Ser Glu His Trp Pro Met Ser Ser lie Phe 

595 600 605 

Phe Asp Ala Gly Ser Lys Gly Ser Leu Ser Pro Cys. Cys Gly Gin Ala 
610 615 620 

20 Ser Thr Pro Leu Ser Ser Met Gly Gly Arg Ser Asn Thr Gin Trp Pro 

625 630 635 640 

Pro Asp Pro Pro Leu His Phe Gly Pro Thr Lys Trp Pro Val Gly Asp 

645 650 655 

Gin Ser Ala Glu Ser Leu Gly Ala Leu Pro Val Gly Ser Trp Gin Leu 
25 660 665 670 

Glu Leu Pro Ser Ala Pro Leu His Val Ser Met Phe Lys Met Arg Ser 

675 680 . 665 

Ala Lys Asp Phe Gly Ala Arg Gly Pro Tyr Met Met Ser Pro Ala Met 
690 695 700 

30 lie Ala Leu Ser Asn Lys Leu Lys Leu Lys Arg Gin Leu Glu Tyr Glu 

705 710 715 720 

Glu Gin Ala Phe Gin Asp Thr Ser Gly Gly Asp Pro Pro Gly Thr Ser 

725 730 , 735 

Ser Ser His Leu Met Trp Lys Arg Met Lys Ser Leu Met Gly Gly Thr 
35 740 745 750 

Cys Pro Leu Met Pro Asp Lys Thr lie Ser Ala Asn Met Ala Pro Asp 
755 760 765 
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Glu Phe Thr Gin Lys Ser Mec Arg Gly Leu Gly Gin Pro Leu Arg His 

775 7 90 
Leu Pro Pro Pro Gin Pro Pro 

785 790 
Lvs Thr Gly Phe Pro Pro Gin Cys Tyr Ala Ser Gin Phe Gin Asp Ty 



770 

Ser Thr Arg Ser Ser Gly Glu Asn Ala 
795 800 



805 8" 815 

Gly Pro Pre Gly Ala .Gin Lys Val Ser Gly Val Ala Ser Arg Leu Leu 

825 830 



320 

ly Pro Ser Phe Glu Pro Tyr Leu Leu Pro Glu Leu Thr Arg Tyr Asp 



835 



840 845 



10 Cys Glu Va 



1 Asn Val Pro Val Pro Gly Ser Ser Thr Leu Leu Gin Gly 



855 



850 

Arg As? Leu Leu Arg Ala Leu Asp Gin Ala Thr 



860 



865 



870 



875 



15 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNES3 : single 
9Q (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
<xi) SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 
Met Glu Gly Ala Gl 
1 5 
25 Arg 



v Glv Ala Asn Asp Lys Lys Lys lie Ser Ser Glu 



10 15 



Ara Lvs Glu Lys Ser Arg Asp Ala Ala Arg Ser Arg Arg Ser Lys 



20 

;lu Ser Glu Val Phe Tyr Glu Leu Ala Hi.s Gin Leu Pro Leu Pro His 



35 



40 



45 



Asn val Ser Ser His Leu Asp Lys Ala Ser Val Met Arg Leu Thr He 

50 55 60 

Ser Tyr Leu Arg Val Arg Lys Leu Leu Asp Ala Gly Asp Leu Asp He 

7 c 80 

6 5 ^ ^ 

Glu Asp Asp Met Lys Ala Gin Mec Asn Cys Phe Tyr Leu Lys Ala" Leu 



35 Asp Gly Phe Va 



85 90 
1 net Val Leu Thr Asp Asp Gly Asp Met He Tyr He 



100 «5 "o 

,-al Asn Lys Tyr Met Gly Leu Thr Gin Phe Glu Leu Thr 



Ser Asp Asn \ 
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115 120 125 

Gly His Ser Val Phe Asp Phe Thr His Pro Cys Asp His Glu Glu Me: 

130 135 140 

Arg Glu Met Leu Thr His Arg Asn Gly Leu Val Lys Lys Gly Lys Glu 
145 150 155 160 

Gin Asn Thr Gin Arg Ser Phe Phe Leu Arg Met Lys Cys Thr Leu Thr 

165 170 175 

Ser Arg Gly Arg Thr Met Asn lie Lys Ser Ala Thr Trp Lys Val Leu 

180 1S5 190 

His Cys Thr Gly His lie His Val Tyr Asp Thr Asn Ser Asn Gin Pro 

195 200 205 

Gin Cys Gly Tyr Lys Lys Pro Pro Met Thr Cys Leu Val Leu lie Cys 

210 215 220 

Glu Pro lie Pro His Pro Ser Asn lie Glu lie Pro Leu Asp Ser Lys 
225 230 235 240 

Thr Phe Leu Ser Arg His Ser Leu Asp Met Lys Phe Ser Tyr Cys Asp 

245 250 255 

Glu Arg lie Thr Glu Leu Met Gly Tyr Glu Pro Glu Giu Leu Leu Gly 

260 265 270 

Arg Ser lie Tyr Glu Tyr Tyr His Ala Leu Asp Ser Asp His Leu Thr 

275 280 285 

Lys Thr His His Asp Met Phe Thr Lys Gly Gin Val Thr Thr Gly Gin 

290 295 300 

Tyr Arg Met Leu Ala Lys Arg Gly Gly Tyr Val Trp Val Glu Thr Gin 
305 310 315 320 

Ala Thr Val lie Tyr Asn Thr Lys Asn Ser Gin Pro Gin Cys lie Val 

325 330 335 

Cys val Asn Tyr Val Val Ser Gly lie lie Gin His Asp Leu lie Phe 

340 345 350 

Ser Leu Gin Gin Thr Glu Cys Val Leu Lys Pro Val Glu Ser Ser Asp 

355 360 365 

Met Lys Met Thr Gin Leu Phe Thr Lys Val Glu Ser Glu Asp Thr Ser 

370 375 380 

Ser Leu Phe Asp Lys Leu Lys Lys Glu Pro Asp Ala Leu Thr Leu Leu 
385 390 395 400 

Ala Pro Ala Ala Gly Asp Thr lie lie Ser Leu Asp Phe Gly Ser Asn 

405 410 415 

Asp Thr Glu Thr Asp Asp Gin Gin Leu Glu Glu Val Pro Leu Tyr Asn 

34 
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4 "? 0 4 2 5 

445 



10 



15 



20 



30 



435 440 



Ala Me 



c ser Pro Leu Pro Thr Ala Glu Thr Pro Lys Pro Leu Arg Ser 



A =q 460 
450 4d5 



Ser Ala Asp 



Pro Ala Leu Asn Gin Glu Val Ala Leu Lys Leu Glu Pro 

475 480 
465 470 45 

Asn Pro Glu Ser Leu Glu Leu Ser Phe Thr Met Pro Gin lie Gin Asp 
485 

~ w q ^ r Thr Arq Gin Ser Ser Pro Glu 
Gin Thr Pro Ser Pro Ser Asp Gly Ser Ttir Arg v* 

500 50t) 



Pro Asn Ser Pro Se 



r Glu TyrCys Phe Tyr Val Asp Ser Asp Met Val 



520 525 
515 520 



Asn Glu Phe Lys Le 
530 



u Glu Leu Val Glu Lys Leu Phe Ala Glu Asp Thr 



535 



540 



Glu Ma Lvs Asn Pro Phe Ser Thr Gin Asp Thr Asp Leu Asp Leu Glu 

550 555 560 

„" Leu Ala Pro Tyr He Pro Met Asp Asp Asp Phe Gin Leu Arg Ser 

570 575 

585 590 
Ala Ser Pro Gin Ser Thr Val Thr Val Phe Gin Gin - Thr Gin He Gin 

600 605 
Glu Pro Thr Ala Asn Ala Thr Thr Thr Thr Ala Thr Thr Asp Glu Leu 



615 620 
610 615 

Met Glu Asp He Lys 



-5 Lvs Thr val Thr Lys Asp Arg — — — = — Ile LeU Ile Ala 

630 "5 "0 

I« Pro Ser Pro Thr His Ile His Lys Glu Thr Thr Ser Ala Thr Ser 

655 

645 

Thr Gin Ser Arg Thr Ala Ser Pro Asn Arg Ala 



650 

Ser Pro Tyr Arg Asp 



665 670 
660 665 



Gly Lys Gly Val He Glu Gin Thr Glu Lys Ser His Pro Arg Ser Pro 

6 8 0 6 8 5 

Asn val Tu ser Val Ala Leu Ser Gin Arg Thr Thr Val Pro Glu Glu 
690 

Ile Leu Ala Leu Gin Asn Ala Gin Arg Lys Arg 



35 Glu Leu Asn Pro Lys 

705 

Lvs Met Glu His Asp Gly 



710 715 720 

Gly ser Leu Phe Gin Ala Val Gly He Gly Thr 

35 
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725 730 735 

Leu Leu Gin Gin Pro Asp Asp His Ala Ala Thr Thr Ser Leu Ser Trp 

740 745 750 

Lys Arg Val Lys Gly Cys Lys Ser Ser Glu Gin Asn Gly Met Glu Gin 
7=5 760 765 

5 Lys Thr He lie Leu He Pro Ser Asp Leu Ala Cys Arg Leu Leu Gly 

770 775 780 

Gin Ser Met Asp Glu Ser Gly Leu Pro Gin Leu Thr Ser Tyr Asp Cys 
785 790 795 800 

Glu Val Asn Ala Pro He Gin Gly Ser Arg Asn Leu Leu Gin Gly Glu 
10 805 810 815 

Glu Leu Leu Arg Ala Leu Asp Gin Val Asn 
820 825 

(2) INFORMATION FOR SEQ ID NO : 7 : 
15 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
20 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ser Ser Glu Arg Arg Lys Glu Lys Ser Arg Asp Ala Ala Arg Ser 
1 5 10 IS 

Arg Arg Thr Lys Glu Ser Glu Val Phe Tyr Glu Leu Ala His Gin Leu 
25 2 0 2 5 3 0 

Pro Leu Pro His Asn Val Ser Ser His Leu Asp Lys Ala Ser Val Met 

35 40 . 45 

Arg Leu Thr lie Ser Tyr Leu Arg Val Arg Lys Leu Leu Asp Ala Gly 
50 55 60 

30 Gly Leu Asp Ser Glu Asp Glu Met Lys Ala Gin Met Asp Cys Phe Tyr 

65 70 75 80 

Leu Lys Ala Leu Asp Gly Phe Val Met Val Leu Thr Asp Asp Gly Asp 

85 90 95 

Met Val Tyr lie Ser Asp Asn Val Asn Lys Tyr Met Gly Leu Thr Gin 
35 100 105 110 

Phe Glu Leu Ala Gly His Ser Val Phe Asp Phe Thr His Pro Cys Asp 
115 120 125 
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His Giu Glu Met 
130 

Lys Gly Lys Glu 
145 

Cys T h r Leu Thr 

Trp Lys Val Leu 
130 

Ser Asn Gin Pro 
195 

Val Leu lie Cys 
210 

Leu Asp Ser Lys 
225 

Ser Tyr Cys Asp 

Glu Leu Leu Gly 
260 

Asp His Leu Thr 
275 

Thr Thr Gly Gin 
290 

Val Glu Thr Gin 
305 

Gin Cys lie Val 

Asp Leu lie Phe 
340 

Glu Ser Ser Asp 
355 

Giu Asp Thr Ser 
370 

Leu Thr Leu Leu 
385 

Phe Gly Ser Asp 

Pro Leu Tyr Asn 
420 



Arg Glu Met Leu 
135 

Leu Asn Thr Gin 
150 

Ser Arg Gly Arc 
165 

His Cys Thr Gly 

Gin Cys Gly Tyr 
200 

Glu Pro lie Pro 
215 

Thr Phe Leu Ser 
230 

Glu Arg lie Thr 
245 

Arg Ser lie Tyr 

Lys Thr His His 
280 

Tyr Arg Met Leu 
295 

Ala Thr Val He 
310 

Cys Val Asn Tyr 
325 

Ser Leu Gin Gin 

Met Lys Met Thr 
360 

Cys Leu Phe Asp 
375 

Ala Pro Ala Ala 
390 

Asp Thr Glu Thr 
405 

Asp Val Met Phe 



Thr His Arg Asn 
140 

Arg Ser Phe Phe 
155 

Thr Met Asn He 
170 

His lie His Val 
185 

Lys Lys Pro Pro 

His Pro Ser Asn 
220 

Arg His Ser Leu 
235 

Glu Leu Met Gly 
250 

Glu Tyr Tyr His 
265 

Asp Met Phe Thr 

Ala Lys Arg Gly 
300 

Tyr Asn Thr Lys 
315 

Val Val Ser Gly 
330 

Thr Glu Ser Val 
345 

Gin Leu Phe Thr 

Lys Leu Lys Lys 
380 

Gly Asp Thr He 
395 

Glu Asp Gin Gin 
410 

Pro Ser Ser Asn 
425 

37 



Gly Pro Val Arg 

Leu Arg Met Lys 
160 

Lys Ser Ala Thr 
175 

Tyr Asp Thr Asn 
190 

Met Thr Cys Leu 
205 

He Glu He Pro 

Asp Met Lys Phe 
240 

Tyr Glu Pro Glu 
255 

Ala Leu Asp Ser 
270 

Lys Gly Gin Val 
285 

Gly Tyr Val Trp 

Asn Ser Gin Pro 
320 

He He Gin His 
335 

Leu Lys Pro Val 
350 

Lys Val Glu Ser 
365 

Giu Pro Asp Ala 

He Ser Leu Asp 
400 

Leu Glu Asp Val 
415 

Glu Lys Leu Asn 
430 
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He Asn Leu Ala Met Ser Pro Leu Pro Ser Ser Glu Thr Pro Lys Pro 



440 445 



Leu Arg Ser Ser Aia Asp Pro Ala Leu Asn Gin Glu Val Ala Leu Lys 

450 "5 460 

Leu Glu Ser Ser Pro Glu Ser Leu Gly Leu Ser Phe Thr Mec Pre Gin 

465 
I 



470 ™ "80 

lie Gin Asp Gin Pro Ala Ser Pro Ser Asp Gly Ser Thr Arg Gin Ser 



485 4 9° 495 



Ser Pro Glu Pro Asn Ser Pro Ser Glu Tyr Cys Phe Asp Val Asp Ser 
500 505 510 

10 Asp Met Val Asn Val Phe Lys Leu Glu Leu Val Glu Lys Leu Phe Ala 



515 520 525 



Glu Asp Thr Glu Ala Lys Asn Pro Phe Ser Thr Gin Asp Thr Asp Leu 

530 535 540 

Asp Leu Glu Met Leu Ala Pro Tyr He Pro Met Asp Asp Asp Phe Gin 



15 545 550 



555 560 



Leu Arg Ser Phe Asp Gin Leu Ser Pro Leu Glu Ser Asn Ser Pro Ser 

= m 575 

565 

Met Ser Thr Val Thr Gly Phe Gin Gin Thr Gin Leu Gin 



585 590 



Pro Pro Ser 

580 

20 Lys Pro Thr He Thr Ala Thr Ala Thr Thr Thr Ala Thr Thr Asp Glu 



595 



600 



605 



Ser Lys Thr Glu Thr Lys Asp Asn Lys Glu Asp lie Lys He Leu He 



610 



615 620 



Ala 



Ser Pro Ser Ser Thr Gin Val Pro Gin Glu Thr Thr Thr Ala Lys 



625 



630 



635 



640 



Ala ser Ala Tyr Ser Gly Thr His Ser Arg Thr Ala Ser Pro Asp Arg 

65.0 655 



64: 



Ala Gly Lys Arg Val He Glu Gin Thr Asp Lys Ala His Pro Arg Ser 
660 S65 670 

30 Leu Asn Leu Ser Ala Thr Leu Asn Gin Arg Asn Thr Val Pro Glu Glu 

675 66° 685 

Glu Leu Asn Pro Lys Thr lie Ala Ser Gin Asn Ala Gin Arg Lys Arg 

690 695 700 

Ly S Met Glu His Asp Gly Ser Leu Phe Gin Ala Ala Gly He Gly Thr 



710 



715 



720 



35 705 

Leu Leu Gin Gin Pro Gly Asp Cys Ala Pro Thr Met Ser Leu Ser 

735 



725 730 
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Lys Arg Val Lys Gly ?he He Ser 
740 

Lys Thr lie lie Leu lie Pro Ser 
755 760 
Gin Ser Met Asp Vai Ser Gly Leu 
5 770 775 

Glu Val Asn Ala Pro lie Gin Gly 
785 7 ^0 
Glu Leu Leu Arg Ala Leu Asp Gin 
805 

10 



Ser Glu Gin Asn Gly Thr Glu Gin 
745 7 50 
Asp Leu Ala Cys Arg Leu Leu Gly 
765 

Pro Gin Leu Thr Ser Tyr Asp Cys 
780 

Ser Arg Asn Leu Leu Gin Gly Glu 
795 800 

Val Asn 
810 



39 
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WHAT IS CLAIMED IS: 

1. An isolated protein comprising a endothelial PAS domain protein 1 (EPAS1) protein 
(SEQ ID NO: 4 or 5), or an EPAS1 protein domain thereof having at least 14 consecutive amino 
acids of SEQ ID NO: 4 or 5 and an EPAS 1 -specific activity. 

2. .An isolated protein according to claim 1 , wherein said protein specifically binds at least 
one of a bHLH/PAS protein, a heat shock protein, or a nucleic acid consisting of SEQ ID NO:3. 

3. A recombinant nucleic acid encoding a protein according to claim 1 . 

4. A cell comprising a nucleic acid according to claim 3. 

5. A method of making an isolated EPAS1 protein, comprising steps: introducing a nucleic 
acid according to claim 3 into a host cell or cellular extract, incubating said host cell or extract 
under conditions whereby said nucleic acid is expressed as a transcript and said transcript is 
expressed as a translation product comprising said protein, and isolating said translation product. 

6. An isolated EP AS1 protein made by the method of claim 5. 

7. An isolated EPAS 1 nucleic acid comprising SEQ ID NO: 1 or 2, or a fragment thereof 
having at least 24 consecutive bases of SEQ ID NO: 1 or 2 and sufficient to specifically hybridize 
with a nucleic acid having the sequence defined by the corresponding SEQ ID NO: 1 or 2 in the 
presence of human or murine genomic DNA, respectively. 

8. .An isolated EPAS 1 nucleic acid according to claim 7, said nucleic acid comprising SEQ 
ID NO:l. or a fragment thereof having at least 24 consecutive bases of SEQ ID NO:I and 
sufficient to specifically hybridize with a nucleic acid having the sequence defined by SEQ ID 
NO:l in the presence of human genomic DNA. 

9. ' An isolated EPAS 1 nucleic acid according to claim 7 ? said nucleic acid comprising SEQ 
ID NO:2, or a fragment thereof having at least 24 consecutive bases of SEQ ID NO:2 and 
sufficient to specifically hybridize with a nucleic acid having the sequence defined by SEQ ID 
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NO:2 in the presence of murine genomic DNA. 



10. A method of screening for an agent which modulates the binding of a EPAS1 protein to 
a binding target, said method comprising the steps of: 
incubating a mixture comprising: 
5 an isolated protein according to claim 1, 

a binding target of said protein, and 
a candidate agent; 

under conditions whereby, but for the presence of said agent, said protein specifically 
binds said binding target at a reference affinity; 
10 detecting the binding affinity of said protein to said binding target to determine an agent- 

biased affinity, 

wherein a difference between the agent-biased affinity and the reference affinity indicates 
that said agent modulates the binding of said protein to said binding target. 

15 11. A method according to claim 10, wherein said binding target is a one of a bHLH/PAS 
protein, a heat shock protein, or a nucleic acid consisting of SEQ ID NO:3. 



BNSDOCID: <WO 9831 701 Ai _l_> 



41 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/O0813 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) : Please Sec Extra Sheet. 

US CL :435/6, 7.1. 69.1. 252.3. 325; 530/350; 536/23.5 . 24.31. 24.33 
According to International Patent Classification (IPC) or to both national classification and IPC 



FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/6. 7.1. 69.1. 252.3, 325; 530/350; 536/23.5. 24.31. 24.33 



Documentation searched other than minimum 



documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and. where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



x 

Y 



Citation of document, with indication, where appropriate, of the relevant passages 



TIAN et ai. Endothelial PAS domain protein 1 (EPAS1), a 
transcription factor selectively expressed in endothelial cells. Genes 
Dev. 01 January 1997, Vol. 11, No. 1. pages 72-82, see entire 
document. 



Relevant to claim No. 



1-9 

10. 11 



| | Further documents are listed in the continuation of Box C. | | See patent family annex. 



Special categories of cited documents: 

document defining the general iuie of the art which ii not considered 
to be of particular relevance 

earlier document published on or after the international filing date 

document which may throw doubt* on priority claim(s) or which is 
cited to establiah the publicauon date of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use. exhibition or other 
mean* 

document published prior to the international filing date but later than 
the priority date claimed 



i.ier document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance, the claimed invenuon cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 



"A." document member of the same patent family 



Date of the actual completion of the international search 



27 MARCH 1998 



Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 



Date of mailing of the international search report 




ROBfcR^CHWARTZMAN 

Telephone No. (703) 308-0196 



Form PCT/ISA/210 (second sheet)(July 1992)* 

BNSDOCIl-. <WO 9831701A1_I_> 





INTERNATIONAL SEARCH REPORT 



International application No. 



PCT/US98/00813 



A. CLASSIFICATION OF SUBJECT MATTER: 

IPC (6): 

C07H 21/04; C07K 14/00; C12N 1/21. 5/10, 15/12; C12P 21/00; C12Q 1/68; G01N 33/53 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 
STN: Medline, Biosis, Embase, CAPlus, WP1DS, JAPIO, PATOSEP, PATOSWO 



APS 

Search terms: EPAS1, endothelial, PAS domain 



Form PCT/ISA/210 (extra shect)(July 1992)* 



BNS0OCI0: <WO 9631701A1 J_> 



